XML Schema Containment Checking Based on Semi-implicit Techniques
نویسندگان
چکیده
XML schemas are computer languages defining grammars for XML (Extensible Markup Languages) documents. Containment checking for XML schemas has many applications, and is thus important. Since XML schemas are related to the class of tree regular languages, their containment checking is reduced to the language containment problem for non-deterministic tree automata (NTAs). However, an NTA for a practical XML schema has 10−10 states for which the textbook algorithm based on naive determinization is expensive. Thus we in this paper consider techniques based on BDDs (binary decision diagrams). We used semi-implicit encoding which encodes a set of subsets of states as a BDD, rather than encoding a set of states by it. The experiment on several real-world XML schemas proves that our containment checker can answer problems that cannot be solved by previously known algorithms.
منابع مشابه
Regular Expression Containment with xs:all-like Operators
We discuss containment checking for regular expressions with operators like XML Schema’s xs:all, that is, the operator that allows a sequence of letters to occur in arbitrary order. In this report, we model this operator by &, and give several algorithms of containment checking for regular expressions with &. Our algorithms are based on the notion of the permutability of word languages. Keyword...
متن کاملSpeeding up Xml Querying
This dissertation develops approaches to testing the satisfiability and the containment of XPath queries in the presence of XML Schema definitions in order to speed up XML querying. XML provides a simple yet powerful mechanism for information storage, processing and delivery, and is a widely used standard data format. XPath is a basic language for querying XML data, and is embedded into many W3...
متن کاملSpeeding up XML querying: satisfiability test & containment test of XPath queries in the presence of XML schema definitions
This dissertation develops approaches to testing the satisfiability and the containment of XPath queries in the presence of XML Schema definitions in order to speed up XML querying. XML provides a simple yet powerful mechanism for information storage, processing and delivery, and is a widely used standard data format. XPath is a basic language for querying XML data, and is embedded into many W3...
متن کاملInitial Specification of a Language Extension with Types and Type Checking
Types are a useful mechanism for early error detection and optimization of any programming language. Semi-structured query evaluation as in Xcerpt needs to be able to cope without schema information or types, in contrast to the relational case. Nevertheless, if available, type information enables query optimization well beyond what is possible in its absence. Furthermore, since semi-structured ...
متن کاملResolving Structural Conflicts in the Integration of XML Schemas: A Semantic Approach
While the Internet has facilitated access to information sources, the task of scalable integration of these heterogeneous data sources remains a challenge. The adoption of the eXtensible Markup Language (XML) as the standard for data representation and exchange has led to an increasing number of XML data sources, both native and non-native. Recent integration work has mainly focused on developi...
متن کامل